Appropriate Item Partition for Improving the Mining Performance

نویسندگان

  • Tzung-Pei Hong
  • Jheng-Nan Huang
  • Kawuu W. Lin
  • Wen-Yang Lin
چکیده

Along with the progress of information techniques and the increase of information need, some databases in the real world grow very quickly and their sizes become very huge. If the FP-Growth procedure is directly executed on these databases to mine association rules, the computer memory may not allow all nodes of a FP-tree generated from a huge database. In this paper, a sophisticated mining approach with a flexible partition of items is proposed to effectively derive association rules under the constraint of memory limitation. The experimental results show that the proposed approach can make the mining process under the memory limitation always feasible.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient Technique for Indexing Temporal Databases

Temporal databases added a new dimension to traditional transaction databases. This dimension is the life time of each item, i.e. exhibition period, starting from the partition when this item appears in the transaction database to the partition when this item no longer exists. Mining temporal association rules became very interesting topic in many applications nowadays. In this paper, an effici...

متن کامل

Prediction of effect of fine particle removal on efficiency of a spiral circuit by size-by-size partition curves

Partition curves are widely used to determine the spiral separator efficiency. In this work, the partition curves were used in order to investigate the particle transportation to concentrate and tailing streams. Simulation of fine particle removal using the size-by-size partition curves showed that the recovery of gangue particles to concentrate can decrease 8.7%. It also showed that the recove...

متن کامل

Use of Semantic Similarity and Web Usage Mining to Alleviate the Drawbacks of User-Based Collaborative Filtering Recommender Systems

  One of the most famous methods for recommendation is user-based Collaborative Filtering (CF). This system compares active user’s items rating with historical rating records of other users to find similar users and recommending items which seems interesting to these similar users and have not been rated by the active user. As a way of computing recommendations, the ultimate goal of the user-ba...

متن کامل

Frequent Pattern Mining using Candidate Generation approach with Single Scan of Database

Most of the algorithms for discovering association rules require multiple passes over the database resulting in a large number of disk reads and placing a huge burden on the I/O subsystem [1]. To reduce this bottleneck in case of large databases, a new association rule mining algorithm, which uses both the Partition and the Apriori approach for calculating the frequent item sets in a single pas...

متن کامل

Examining the Performance of Vertical Fragmentation using FP-MAX Algorithm

Today’s business Environment has an increasing need for consistent, scalable, reliable and accessible information which grows steadily.The purpose of this work is to analyse the performance of Vertical Fragmentation on large as well as small database such as educational database, data warehouses, medical databases. Vertical Fragmentation has an important impact in improving the performance of m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010